Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 459347 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 314.9 MiB |
| Average record size in memory | 718.7 B |
Variable types
| Numeric | 6 |
|---|---|
| Text | 5 |
| Categorical | 5 |
| DateTime | 1 |
mmr is highly overall correlated with odometer and 2 other fields | High correlation |
odometer is highly overall correlated with mmr and 2 other fields | High correlation |
sellingprice is highly overall correlated with mmr and 2 other fields | High correlation |
year is highly overall correlated with mmr and 2 other fields | High correlation |
body is highly imbalanced (52.2%) | Imbalance |
transmission is highly imbalanced (78.5%) | Imbalance |
interior is highly imbalanced (51.4%) | Imbalance |
Unnamed: 0 has unique values | Unique |
vin has unique values | Unique |
Reproduction
| Analysis started | 2024-11-14 01:41:09.016466 |
|---|---|
| Analysis finished | 2024-11-14 01:41:22.315089 |
| Duration | 13.3 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
Unnamed: 0
Real number (ℝ)
UNIQUE 
| Distinct | 459347 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 284634.94 |
| Minimum | 0 |
|---|---|
| Maximum | 558836 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 37951.6 |
| Q1 | 148497.5 |
| median | 284795 |
| Q3 | 421203.5 |
| 95-th percentile | 531202.7 |
| Maximum | 558836 |
| Range | 558836 |
| Interquartile range (IQR) | 272706 |
Descriptive statistics
| Standard deviation | 158181.92 |
|---|---|
| Coefficient of variation (CV) | 0.55573613 |
| Kurtosis | -1.1767618 |
| Mean | 284634.94 |
| Median Absolute Deviation (MAD) | 136354 |
| Skewness | -0.013047283 |
| Sum | 1.3074621 × 1011 |
| Variance | 2.502152 × 1010 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 375652 | 1 | < 0.1% |
| 375474 | 1 | < 0.1% |
| 375473 | 1 | < 0.1% |
| 375472 | 1 | < 0.1% |
| 375471 | 1 | < 0.1% |
| 375470 | 1 | < 0.1% |
| 375469 | 1 | < 0.1% |
| 375468 | 1 | < 0.1% |
| 375467 | 1 | < 0.1% |
| Other values (459337) | 459337 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 558836 | 1 | |
| 558835 | 1 | |
| 558834 | 1 | |
| 558833 | 1 | |
| 558831 | 1 | |
| 558828 | 1 | |
| 558827 | 1 | |
| 558826 | 1 | |
| 558825 | 1 | |
| 558824 | 1 |
year
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 26 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2010.2225 |
| Minimum | 1990 |
|---|---|
| Maximum | 2015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.0 MiB |
Quantile statistics
| Minimum | 1990 |
|---|---|
| 5-th percentile | 2003 |
| Q1 | 2008 |
| median | 2012 |
| Q3 | 2013 |
| 95-th percentile | 2014 |
| Maximum | 2015 |
| Range | 25 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.8341173 |
|---|---|
| Coefficient of variation (CV) | 0.0019073099 |
| Kurtosis | 1.0581366 |
| Mean | 2010.2225 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -1.215319 |
| Sum | 9.2338968 × 108 |
| Variance | 14.700456 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2013 | 85553 | |
| 2012 | 85176 | |
| 2014 | 68540 | |
| 2011 | 39984 | |
| 2008 | 25853 | 5.6% |
| 2007 | 24351 | 5.3% |
| 2010 | 21693 | 4.7% |
| 2006 | 20809 | 4.5% |
| 2009 | 17178 | 3.7% |
| 2005 | 16637 | 3.6% |
| Other values (16) | 53573 |
| Value | Count | Frequency (%) |
| 1990 | 33 | < 0.1% |
| 1991 | 51 | < 0.1% |
| 1992 | 95 | < 0.1% |
| 1993 | 127 | < 0.1% |
| 1994 | 286 | 0.1% |
| 1995 | 477 | 0.1% |
| 1996 | 562 | 0.1% |
| 1997 | 1030 | |
| 1998 | 1446 | |
| 1999 | 2187 |
| Value | Count | Frequency (%) |
| 2015 | 7904 | 1.7% |
| 2014 | 68540 | |
| 2013 | 85553 | |
| 2012 | 85176 | |
| 2011 | 39984 | |
| 2010 | 21693 | 4.7% |
| 2009 | 17178 | 3.7% |
| 2008 | 25853 | 5.6% |
| 2007 | 24351 | 5.3% |
| 2006 | 20809 | 4.5% |
make
Text
| Distinct | 53 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.1 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 5.9949951 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2753783 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | KIA |
|---|---|
| 2nd row | KIA |
| 3rd row | BMW |
| 4th row | VOLVO |
| 5th row | BMW |
| Value | Count | Frequency (%) |
| ford | 78858 | |
| chevrolet | 52580 | 11.4% |
| nissan | 43128 | 9.4% |
| toyota | 34463 | 7.5% |
| dodge | 26364 | 5.7% |
| honda | 24165 | 5.2% |
| hyundai | 18286 | 4.0% |
| bmw | 16880 | 3.7% |
| kia | 15541 | 3.4% |
| chrysler | 14758 | 3.2% |
| Other values (45) | 135610 |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 280617 | 10.2% |
| E | 250663 | 9.1% |
| D | 206496 | 7.5% |
| A | 205369 | 7.5% |
| N | 201752 | 7.3% |
| R | 198602 | 7.2% |
| I | 175681 | 6.4% |
| S | 153970 | 5.6% |
| T | 146526 | 5.3% |
| C | 123359 | 4.5% |
| Other values (17) | 810748 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2739006 | |
| Dash Punctuation | 13491 | 0.5% |
| Space Separator | 1286 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 280617 | 10.2% |
| E | 250663 | 9.2% |
| D | 206496 | 7.5% |
| A | 205369 | 7.5% |
| N | 201752 | 7.4% |
| R | 198602 | 7.3% |
| I | 175681 | 6.4% |
| S | 153970 | 5.6% |
| T | 146526 | 5.3% |
| C | 123359 | 4.5% |
| Other values (15) | 795971 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 13491 |
Space Separator
| Value | Count | Frequency (%) |
| 1286 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2739006 | |
| Common | 14777 | 0.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| O | 280617 | 10.2% |
| E | 250663 | 9.2% |
| D | 206496 | 7.5% |
| A | 205369 | 7.5% |
| N | 201752 | 7.4% |
| R | 198602 | 7.3% |
| I | 175681 | 6.4% |
| S | 153970 | 5.6% |
| T | 146526 | 5.3% |
| C | 123359 | 4.5% |
| Other values (15) | 795971 |
Common
| Value | Count | Frequency (%) |
| - | 13491 | |
| 1286 | 8.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2753783 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| O | 280617 | 10.2% |
| E | 250663 | 9.1% |
| D | 206496 | 7.5% |
| A | 205369 | 7.5% |
| N | 201752 | 7.3% |
| R | 198602 | 7.2% |
| I | 175681 | 6.4% |
| S | 153970 | 5.6% |
| T | 146526 | 5.3% |
| C | 123359 | 4.5% |
| Other values (17) | 810748 |
model
Text
| Distinct | 764 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.4 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 23 |
| Mean length | 6.7515647 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3101311 |
|---|---|
| Distinct characters | 39 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 28 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | SORENTO |
|---|---|
| 2nd row | SORENTO |
| 3rd row | 3 SERIES |
| 4th row | S60 |
| 5th row | 6 SERIES GRAN COUPE |
| Value | Count | Frequency (%) |
| altima | 16192 | 2.9% |
| series | 12888 | 2.3% |
| fusion | 12516 | 2.2% |
| grand | 12226 | 2.2% |
| 1500 | 12169 | 2.2% |
| camry | 11651 | 2.1% |
| f-150 | 11561 | 2.1% |
| escape | 10542 | 1.9% |
| focus | 9403 | 1.7% |
| g | 8594 | 1.5% |
| Other values (669) | 442662 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 358099 | 11.5% |
| E | 272660 | 8.8% |
| R | 255538 | 8.2% |
| S | 212136 | 6.8% |
| O | 174142 | 5.6% |
| C | 170126 | 5.5% |
| I | 156903 | 5.1% |
| N | 156716 | 5.1% |
| T | 143788 | 4.6% |
| L | 128808 | 4.2% |
| Other values (29) | 1072395 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2751419 | |
| Decimal Number | 211659 | 6.8% |
| Space Separator | 101057 | 3.3% |
| Dash Punctuation | 37082 | 1.2% |
| Other Punctuation | 94 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 358099 | |
| E | 272660 | 9.9% |
| R | 255538 | 9.3% |
| S | 212136 | 7.7% |
| O | 174142 | 6.3% |
| C | 170126 | 6.2% |
| I | 156903 | 5.7% |
| N | 156716 | 5.7% |
| T | 143788 | 5.2% |
| L | 128808 | 4.7% |
| Other values (16) | 722503 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 78170 | |
| 5 | 48314 | |
| 3 | 25898 | 12.2% |
| 1 | 24973 | 11.8% |
| 2 | 11968 | 5.7% |
| 4 | 9242 | 4.4% |
| 6 | 7213 | 3.4% |
| 7 | 3218 | 1.5% |
| 9 | 1908 | 0.9% |
| 8 | 755 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 101057 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 37082 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 94 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2751419 | |
| Common | 349892 | 11.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 358099 | |
| E | 272660 | 9.9% |
| R | 255538 | 9.3% |
| S | 212136 | 7.7% |
| O | 174142 | 6.3% |
| C | 170126 | 6.2% |
| I | 156903 | 5.7% |
| N | 156716 | 5.7% |
| T | 143788 | 5.2% |
| L | 128808 | 4.7% |
| Other values (16) | 722503 |
Common
| Value | Count | Frequency (%) |
| 101057 | ||
| 0 | 78170 | |
| 5 | 48314 | |
| - | 37082 | 10.6% |
| 3 | 25898 | 7.4% |
| 1 | 24973 | 7.1% |
| 2 | 11968 | 3.4% |
| 4 | 9242 | 2.6% |
| 6 | 7213 | 2.1% |
| 7 | 3218 | 0.9% |
| Other values (3) | 2757 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3101311 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 358099 | 11.5% |
| E | 272660 | 8.8% |
| R | 255538 | 8.2% |
| S | 212136 | 6.8% |
| O | 174142 | 5.6% |
| C | 170126 | 5.5% |
| I | 156903 | 5.1% |
| N | 156716 | 5.1% |
| T | 143788 | 4.6% |
| L | 128808 | 4.2% |
| Other values (29) | 1072395 |
trim
Text
| Distinct | 1474 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 MiB |
Length
| Max length | 46 |
|---|---|
| Median length | 37 |
| Mean length | 4.6401021 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2131417 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 107 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | LX |
|---|---|
| 2nd row | LX |
| 3rd row | 328I SULEV |
| 4th row | T5 |
| 5th row | 650I |
| Value | Count | Frequency (%) |
| base | 47437 | 8.5% |
| se | 42514 | 7.6% |
| s | 24910 | 4.5% |
| lx | 18888 | 3.4% |
| lt | 17804 | 3.2% |
| limited | 16840 | 3.0% |
| 2.5 | 15720 | 2.8% |
| ls | 15529 | 2.8% |
| xlt | 14744 | 2.6% |
| sport | 14573 | 2.6% |
| Other values (840) | 328638 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 236801 | 11.1% |
| S | 236432 | 11.1% |
| L | 200284 | 9.4% |
| T | 165312 | 7.8% |
| I | 116773 | 5.5% |
| 98250 | 4.6% | |
| A | 96253 | 4.5% |
| X | 90572 | 4.2% |
| R | 89134 | 4.2% |
| 5 | 59231 | 2.8% |
| Other values (36) | 742375 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1728110 | |
| Decimal Number | 244078 | 11.5% |
| Space Separator | 98250 | 4.6% |
| Other Punctuation | 42376 | 2.0% |
| Dash Punctuation | 16907 | 0.8% |
| Math Symbol | 1592 | 0.1% |
| Open Punctuation | 52 | < 0.1% |
| Close Punctuation | 52 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 236801 | |
| S | 236432 | |
| L | 200284 | |
| T | 165312 | |
| I | 116773 | 6.8% |
| A | 96253 | 5.6% |
| X | 90572 | 5.2% |
| R | 89134 | 5.2% |
| U | 53561 | 3.1% |
| B | 52521 | 3.0% |
| Other values (16) | 390467 |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 59231 | |
| 2 | 46618 | |
| 3 | 41462 | |
| 0 | 37815 | |
| 1 | 15093 | 6.2% |
| 7 | 11646 | 4.8% |
| 8 | 11459 | 4.7% |
| 6 | 10651 | 4.4% |
| 4 | 9848 | 4.0% |
| 9 | 255 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 39882 | |
| / | 2077 | 4.9% |
| ! | 383 | 0.9% |
| ' | 28 | 0.1% |
| : | 6 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 98250 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 16907 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1592 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 52 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 52 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1728110 | |
| Common | 403307 | 18.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 236801 | |
| S | 236432 | |
| L | 200284 | |
| T | 165312 | |
| I | 116773 | 6.8% |
| A | 96253 | 5.6% |
| X | 90572 | 5.2% |
| R | 89134 | 5.2% |
| U | 53561 | 3.1% |
| B | 52521 | 3.0% |
| Other values (16) | 390467 |
Common
| Value | Count | Frequency (%) |
| 98250 | ||
| 5 | 59231 | |
| 2 | 46618 | |
| 3 | 41462 | |
| . | 39882 | |
| 0 | 37815 | 9.4% |
| - | 16907 | 4.2% |
| 1 | 15093 | 3.7% |
| 7 | 11646 | 2.9% |
| 8 | 11459 | 2.8% |
| Other values (10) | 24944 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2131417 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 236801 | 11.1% |
| S | 236432 | 11.1% |
| L | 200284 | 9.4% |
| T | 165312 | 7.8% |
| I | 116773 | 5.5% |
| 98250 | 4.6% | |
| A | 96253 | 4.5% |
| X | 90572 | 4.2% |
| R | 89134 | 4.2% |
| 5 | 59231 | 2.8% |
| Other values (36) | 742375 |
body
Categorical
IMBALANCE 
| Distinct | 45 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.8 MiB |
| SEDAN | |
|---|---|
| SUV | |
| HATCHBACK | |
| MINIVAN | |
| COUPE | 15357 |
| Other values (40) |
Length
| Max length | 23 |
|---|---|
| Median length | 5 |
| Mean length | 5.2971174 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2433215 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | SUV |
|---|---|
| 2nd row | SUV |
| 3rd row | SEDAN |
| 4th row | SEDAN |
| 5th row | SEDAN |
Common Values
| Value | Count | Frequency (%) |
| SEDAN | 206429 | |
| SUV | 117229 | |
| HATCHBACK | 23184 | 5.0% |
| MINIVAN | 21429 | 4.7% |
| COUPE | 15357 | 3.3% |
| WAGON | 13816 | 3.0% |
| CREW CAB | 13627 | 3.0% |
| CONVERTIBLE | 8935 | 1.9% |
| SUPERCREW | 7278 | 1.6% |
| G SEDAN | 6812 | 1.5% |
| Other values (35) | 25251 | 5.5% |
Length
| Value | Count | Frequency (%) |
| sedan | 213241 | |
| suv | 117229 | |
| cab | 27572 | 5.5% |
| hatchback | 23184 | 4.7% |
| minivan | 21429 | 4.3% |
| coupe | 17356 | 3.5% |
| wagon | 13856 | 2.8% |
| crew | 13627 | 2.7% |
| convertible | 9357 | 1.9% |
| g | 8594 | 1.7% |
| Other values (32) | 32014 | 6.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 345276 | |
| A | 339653 | |
| E | 303283 | |
| N | 288531 | |
| D | 225679 | |
| U | 154964 | |
| V | 152657 | |
| C | 126946 | 5.2% |
| B | 65787 | 2.7% |
| I | 54009 | 2.2% |
| Other values (19) | 376430 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2393741 | |
| Space Separator | 38112 | 1.6% |
| Dash Punctuation | 1145 | < 0.1% |
| Decimal Number | 217 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 345276 | |
| A | 339653 | |
| E | 303283 | |
| N | 288531 | |
| D | 225679 | |
| U | 154964 | |
| V | 152657 | |
| C | 126946 | 5.3% |
| B | 65787 | 2.7% |
| I | 54009 | 2.3% |
| Other values (12) | 336956 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 76 | |
| 0 | 76 | |
| 3 | 30 | 13.8% |
| 7 | 30 | 13.8% |
| 4 | 5 | 2.3% |
Space Separator
| Value | Count | Frequency (%) |
| 38112 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1145 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2393741 | |
| Common | 39474 | 1.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 345276 | |
| A | 339653 | |
| E | 303283 | |
| N | 288531 | |
| D | 225679 | |
| U | 154964 | |
| V | 152657 | |
| C | 126946 | 5.3% |
| B | 65787 | 2.7% |
| I | 54009 | 2.3% |
| Other values (12) | 336956 |
Common
| Value | Count | Frequency (%) |
| 38112 | ||
| - | 1145 | 2.9% |
| 6 | 76 | 0.2% |
| 0 | 76 | 0.2% |
| 3 | 30 | 0.1% |
| 7 | 30 | 0.1% |
| 4 | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2433215 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 345276 | |
| A | 339653 | |
| E | 303283 | |
| N | 288531 | |
| D | 225679 | |
| U | 154964 | |
| V | 152657 | |
| C | 126946 | 5.2% |
| B | 65787 | 2.7% |
| I | 54009 | 2.2% |
| Other values (19) | 376430 |
transmission
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 32.4 MiB |
| AUTOMATIC | |
|---|---|
| MANUAL | 15711 |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.8973913 |
| Min length | 6 |
Characters and Unicode
| Total characters | 4086990 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AUTOMATIC |
|---|---|
| 2nd row | AUTOMATIC |
| 3rd row | AUTOMATIC |
| 4th row | AUTOMATIC |
| 5th row | AUTOMATIC |
Common Values
| Value | Count | Frequency (%) |
| AUTOMATIC | 443636 | |
| MANUAL | 15711 | 3.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| automatic | 443636 | |
| manual | 15711 | 3.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 918694 | |
| T | 887272 | |
| U | 459347 | |
| M | 459347 | |
| O | 443636 | |
| I | 443636 | |
| C | 443636 | |
| N | 15711 | 0.4% |
| L | 15711 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4086990 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 918694 | |
| T | 887272 | |
| U | 459347 | |
| M | 459347 | |
| O | 443636 | |
| I | 443636 | |
| C | 443636 | |
| N | 15711 | 0.4% |
| L | 15711 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4086990 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 918694 | |
| T | 887272 | |
| U | 459347 | |
| M | 459347 | |
| O | 443636 | |
| I | 443636 | |
| C | 443636 | |
| N | 15711 | 0.4% |
| L | 15711 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4086990 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 918694 | |
| T | 887272 | |
| U | 459347 | |
| M | 459347 | |
| O | 443636 | |
| I | 443636 | |
| C | 443636 | |
| N | 15711 | 0.4% |
| L | 15711 | 0.4% |
vin
Text
UNIQUE 
| Distinct | 459347 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.9 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 17 |
| Mean length | 17 |
| Min length | 17 |
Characters and Unicode
| Total characters | 7808899 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 459347 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 5xyktca69fg566472 |
|---|---|
| 2nd row | 5xyktca69fg561319 |
| 3rd row | wba3c1c51ek116351 |
| 4th row | yv1612tb4f1310987 |
| 5th row | wba6b2c57ed129731 |
| Value | Count | Frequency (%) |
| 5xyktca69fg566472 | 1 | < 0.1% |
| knagm4ad8d5056639 | 1 | < 0.1% |
| yv1612tb4f1310987 | 1 | < 0.1% |
| wba6b2c57ed129731 | 1 | < 0.1% |
| 1n4al3ap1fn326013 | 1 | < 0.1% |
| wbsfv9c51ed593089 | 1 | < 0.1% |
| 1g1pc5sb2e7128460 | 1 | < 0.1% |
| wauffafl3en030343 | 1 | < 0.1% |
| 2g1fb3d37e9218789 | 1 | < 0.1% |
| wauhgafc0en062916 | 1 | < 0.1% |
| Other values (459337) | 459337 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 757751 | 9.7% |
| 2 | 523213 | 6.7% |
| 3 | 504651 | 6.5% |
| 5 | 489867 | 6.3% |
| 4 | 468123 | 6.0% |
| 0 | 409790 | 5.2% |
| 6 | 399840 | 5.1% |
| 7 | 376083 | 4.8% |
| 8 | 370564 | 4.7% |
| c | 317882 | 4.1% |
| Other values (23) | 3191135 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4609856 | |
| Lowercase Letter | 3199043 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 317882 | 9.9% |
| a | 296915 | 9.3% |
| d | 236327 | 7.4% |
| f | 233161 | 7.3% |
| b | 220594 | 6.9% |
| e | 201666 | 6.3% |
| g | 193397 | 6.0% |
| n | 157669 | 4.9% |
| k | 132047 | 4.1% |
| h | 125435 | 3.9% |
| Other values (13) | 1083950 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 757751 | |
| 2 | 523213 | |
| 3 | 504651 | |
| 5 | 489867 | |
| 4 | 468123 | |
| 0 | 409790 | |
| 6 | 399840 | |
| 7 | 376083 | |
| 8 | 370564 | |
| 9 | 309974 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4609856 | |
| Latin | 3199043 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 317882 | 9.9% |
| a | 296915 | 9.3% |
| d | 236327 | 7.4% |
| f | 233161 | 7.3% |
| b | 220594 | 6.9% |
| e | 201666 | 6.3% |
| g | 193397 | 6.0% |
| n | 157669 | 4.9% |
| k | 132047 | 4.1% |
| h | 125435 | 3.9% |
| Other values (13) | 1083950 |
Common
| Value | Count | Frequency (%) |
| 1 | 757751 | |
| 2 | 523213 | |
| 3 | 504651 | |
| 5 | 489867 | |
| 4 | 468123 | |
| 0 | 409790 | |
| 6 | 399840 | |
| 7 | 376083 | |
| 8 | 370564 | |
| 9 | 309974 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7808899 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 757751 | 9.7% |
| 2 | 523213 | 6.7% |
| 3 | 504651 | 6.5% |
| 5 | 489867 | 6.3% |
| 4 | 468123 | 6.0% |
| 0 | 409790 | 5.2% |
| 6 | 399840 | 5.1% |
| 7 | 376083 | 4.8% |
| 8 | 370564 | 4.7% |
| c | 317882 | 4.1% |
| Other values (23) | 3191135 |
state
Categorical
| Distinct | 34 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.4 MiB |
| FL | |
|---|---|
| CA | |
| TX | |
| GA | |
| PA | |
| Other values (29) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 918694 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CA |
|---|---|
| 2nd row | CA |
| 3rd row | CA |
| 4th row | CA |
| 5th row | CA |
Common Values
| Value | Count | Frequency (%) |
| FL | 73405 | |
| CA | 63579 | |
| TX | 40393 | 8.8% |
| GA | 29761 | 6.5% |
| PA | 23522 | 5.1% |
| NJ | 22578 | 4.9% |
| IL | 21144 | 4.6% |
| OH | 19771 | 4.3% |
| TN | 18476 | 4.0% |
| NC | 18040 | 3.9% |
| Other values (24) | 128678 |
Length
| Value | Count | Frequency (%) |
| fl | 73405 | |
| ca | 63579 | |
| tx | 40393 | 8.8% |
| ga | 29761 | 6.5% |
| pa | 23522 | 5.1% |
| nj | 22578 | 4.9% |
| il | 21144 | 4.6% |
| oh | 19771 | 4.3% |
| tn | 18476 | 4.0% |
| nc | 18040 | 3.9% |
| Other values (24) | 128678 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 148225 | |
| L | 96291 | |
| C | 91505 | |
| N | 91012 | |
| F | 73405 | |
| T | 60550 | 6.6% |
| M | 54874 | 6.0% |
| I | 49027 | 5.3% |
| O | 41983 | 4.6% |
| X | 40393 | 4.4% |
| Other values (14) | 171429 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 918694 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 148225 | |
| L | 96291 | |
| C | 91505 | |
| N | 91012 | |
| F | 73405 | |
| T | 60550 | 6.6% |
| M | 54874 | 6.0% |
| I | 49027 | 5.3% |
| O | 41983 | 4.6% |
| X | 40393 | 4.4% |
| Other values (14) | 171429 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 918694 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 148225 | |
| L | 96291 | |
| C | 91505 | |
| N | 91012 | |
| F | 73405 | |
| T | 60550 | 6.6% |
| M | 54874 | 6.0% |
| I | 49027 | 5.3% |
| O | 41983 | 4.6% |
| X | 40393 | 4.4% |
| Other values (14) | 171429 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 918694 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 148225 | |
| L | 96291 | |
| C | 91505 | |
| N | 91012 | |
| F | 73405 | |
| T | 60550 | 6.6% |
| M | 54874 | 6.0% |
| I | 49027 | 5.3% |
| O | 41983 | 4.6% |
| X | 40393 | 4.4% |
| Other values (14) | 171429 |
condition
Real number (ℝ)
| Distinct | 41 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.787189 |
| Minimum | 1 |
|---|---|
| Maximum | 49 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 24 |
| median | 35 |
| Q3 | 42 |
| 95-th percentile | 47 |
| Maximum | 49 |
| Range | 48 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 13.307967 |
|---|---|
| Coefficient of variation (CV) | 0.43225663 |
| Kurtosis | -0.18702975 |
| Mean | 30.787189 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | -0.8428269 |
| Sum | 14142003 |
| Variance | 177.10198 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 19 | 35669 | 7.8% |
| 35 | 22454 | 4.9% |
| 37 | 21986 | 4.8% |
| 44 | 21579 | 4.7% |
| 43 | 21095 | 4.6% |
| 42 | 20604 | 4.5% |
| 36 | 19571 | 4.3% |
| 41 | 19305 | 4.2% |
| 39 | 17119 | 3.7% |
| 2 | 16741 | 3.6% |
| Other values (31) | 243224 |
| Value | Count | Frequency (%) |
| 1 | 5688 | 1.2% |
| 2 | 16741 | |
| 3 | 8906 | |
| 4 | 16737 | |
| 5 | 9230 | |
| 11 | 77 | < 0.1% |
| 12 | 82 | < 0.1% |
| 13 | 71 | < 0.1% |
| 14 | 111 | < 0.1% |
| 15 | 113 | < 0.1% |
| Value | Count | Frequency (%) |
| 49 | 10850 | |
| 48 | 10683 | |
| 47 | 9543 | |
| 46 | 10594 | |
| 45 | 10440 | |
| 44 | 21579 | |
| 43 | 21095 | |
| 42 | 20604 | |
| 41 | 19305 | |
| 39 | 17119 |
odometer
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 159073 |
|---|---|
| Distinct (%) | 34.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 66644.169 |
| Minimum | 1 |
|---|---|
| Maximum | 999999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 10397 |
| Q1 | 27947.5 |
| median | 50683 |
| Q3 | 96674 |
| 95-th percentile | 166075.8 |
| Maximum | 999999 |
| Range | 999998 |
| Interquartile range (IQR) | 68726.5 |
Descriptive statistics
| Standard deviation | 52181.19 |
|---|---|
| Coefficient of variation (CV) | 0.78298207 |
| Kurtosis | 15.151132 |
| Mean | 66644.169 |
| Median Absolute Deviation (MAD) | 29180 |
| Skewness | 1.9277255 |
| Sum | 3.0612799 × 1010 |
| Variance | 2.7228766 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 845 | 0.2% |
| 999999 | 60 | < 0.1% |
| 10 | 24 | < 0.1% |
| 24023 | 17 | < 0.1% |
| 33995 | 17 | < 0.1% |
| 21587 | 17 | < 0.1% |
| 35888 | 16 | < 0.1% |
| 21310 | 16 | < 0.1% |
| 29850 | 16 | < 0.1% |
| 36265 | 16 | < 0.1% |
| Other values (159063) | 458303 |
| Value | Count | Frequency (%) |
| 1 | 845 | |
| 2 | 10 | < 0.1% |
| 3 | 5 | < 0.1% |
| 4 | 7 | < 0.1% |
| 5 | 12 | < 0.1% |
| 6 | 12 | < 0.1% |
| 7 | 12 | < 0.1% |
| 8 | 15 | < 0.1% |
| 9 | 9 | < 0.1% |
| 10 | 24 | < 0.1% |
| Value | Count | Frequency (%) |
| 999999 | 60 | |
| 980113 | 1 | < 0.1% |
| 959276 | 1 | < 0.1% |
| 694978 | 2 | < 0.1% |
| 621388 | 1 | < 0.1% |
| 537334 | 1 | < 0.1% |
| 522212 | 1 | < 0.1% |
| 495757 | 1 | < 0.1% |
| 480747 | 1 | < 0.1% |
| 471114 | 1 | < 0.1% |
color
Categorical
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.1 MiB |
| BLACK | |
|---|---|
| WHITE | |
| SILVER | |
| GRAY | |
| BLUE | |
| Other values (15) |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 4.6142023 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2119520 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | WHITE |
|---|---|
| 2nd row | WHITE |
| 3rd row | GRAY |
| 4th row | WHITE |
| 5th row | GRAY |
Common Values
| Value | Count | Frequency (%) |
| BLACK | 90446 | |
| WHITE | 86860 | |
| SILVER | 69167 | |
| GRAY | 68734 | |
| BLUE | 41985 | |
| RED | 36278 | |
| — | 21678 | 4.7% |
| GOLD | 9266 | 2.0% |
| GREEN | 9038 | 2.0% |
| BURGUNDY | 7402 | 1.6% |
| Other values (10) | 18493 | 4.0% |
Length
| Value | Count | Frequency (%) |
| black | 90446 | |
| white | 86860 | |
| silver | 69167 | |
| gray | 68734 | |
| blue | 41985 | |
| red | 36278 | |
| — | 21678 | 4.7% |
| gold | 9266 | 2.0% |
| green | 9038 | 2.0% |
| burgundy | 7402 | 1.6% |
| Other values (10) | 18493 | 4.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 272173 | |
| L | 214561 | 10.1% |
| R | 199605 | 9.4% |
| I | 164698 | 7.8% |
| A | 161704 | 7.6% |
| B | 152539 | 7.2% |
| G | 103368 | 4.9% |
| W | 94546 | 4.5% |
| C | 91280 | 4.3% |
| K | 90485 | 4.3% |
| Other values (15) | 574561 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2096645 | |
| Dash Punctuation | 22875 | 1.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 272173 | |
| L | 214561 | 10.2% |
| R | 199605 | 9.5% |
| I | 164698 | 7.9% |
| A | 161704 | 7.7% |
| B | 152539 | 7.3% |
| G | 103368 | 4.9% |
| W | 94546 | 4.5% |
| C | 91280 | 4.4% |
| K | 90485 | 4.3% |
| Other values (13) | 551686 |
Dash Punctuation
| Value | Count | Frequency (%) |
| — | 21678 | |
| - | 1197 | 5.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2096645 | |
| Common | 22875 | 1.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 272173 | |
| L | 214561 | 10.2% |
| R | 199605 | 9.5% |
| I | 164698 | 7.9% |
| A | 161704 | 7.7% |
| B | 152539 | 7.3% |
| G | 103368 | 4.9% |
| W | 94546 | 4.5% |
| C | 91280 | 4.4% |
| K | 90485 | 4.3% |
| Other values (13) | 551686 |
Common
| Value | Count | Frequency (%) |
| — | 21678 | |
| - | 1197 | 5.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2097842 | |
| Punctuation | 21678 | 1.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 272173 | |
| L | 214561 | 10.2% |
| R | 199605 | 9.5% |
| I | 164698 | 7.9% |
| A | 161704 | 7.7% |
| B | 152539 | 7.3% |
| G | 103368 | 4.9% |
| W | 94546 | 4.5% |
| C | 91280 | 4.4% |
| K | 90485 | 4.3% |
| Other values (14) | 552883 |
Punctuation
| Value | Count | Frequency (%) |
| — | 21678 |
interior
Categorical
IMBALANCE 
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.7 MiB |
| BLACK | |
|---|---|
| GRAY | |
| BEIGE | |
| TAN | |
| — | 9646 |
| Other values (12) | 11398 |
Length
| Max length | 9 |
|---|---|
| Median length | 5 |
| Mean length | 4.4329037 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2036241 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BLACK |
|---|---|
| 2nd row | BEIGE |
| 3rd row | BLACK |
| 4th row | BLACK |
| 5th row | BLACK |
Common Values
| Value | Count | Frequency (%) |
| BLACK | 204136 | |
| GRAY | 148477 | |
| BEIGE | 49038 | 10.7% |
| TAN | 36652 | 8.0% |
| — | 9646 | 2.1% |
| BROWN | 6869 | 1.5% |
| RED | 1088 | 0.2% |
| SILVER | 953 | 0.2% |
| BLUE | 894 | 0.2% |
| OFF-WHITE | 350 | 0.1% |
| Other values (7) | 1244 | 0.3% |
Length
| Value | Count | Frequency (%) |
| black | 204136 | |
| gray | 148477 | |
| beige | 49038 | 10.7% |
| tan | 36652 | 8.0% |
| — | 9646 | 2.1% |
| brown | 6869 | 1.5% |
| red | 1088 | 0.2% |
| silver | 953 | 0.2% |
| blue | 894 | 0.2% |
| off-white | 350 | 0.1% |
| Other values (7) | 1244 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 389382 | |
| B | 261088 | |
| L | 206574 | |
| C | 204136 | |
| K | 204136 | |
| G | 198258 | |
| R | 158129 | |
| Y | 148647 | 7.3% |
| E | 102375 | 5.0% |
| I | 50547 | 2.5% |
| Other values (13) | 112969 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2026245 | |
| Dash Punctuation | 9996 | 0.5% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 389382 | |
| B | 261088 | |
| L | 206574 | |
| C | 204136 | |
| K | 204136 | |
| G | 198258 | |
| R | 158129 | |
| Y | 148647 | 7.3% |
| E | 102375 | 5.1% |
| I | 50547 | 2.5% |
| Other values (11) | 102973 | 5.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| — | 9646 | |
| - | 350 | 3.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2026245 | |
| Common | 9996 | 0.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 389382 | |
| B | 261088 | |
| L | 206574 | |
| C | 204136 | |
| K | 204136 | |
| G | 198258 | |
| R | 158129 | |
| Y | 148647 | 7.3% |
| E | 102375 | 5.1% |
| I | 50547 | 2.5% |
| Other values (11) | 102973 | 5.1% |
Common
| Value | Count | Frequency (%) |
| — | 9646 | |
| - | 350 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2026595 | |
| Punctuation | 9646 | 0.5% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 389382 | |
| B | 261088 | |
| L | 206574 | |
| C | 204136 | |
| K | 204136 | |
| G | 198258 | |
| R | 158129 | |
| Y | 148647 | 7.3% |
| E | 102375 | 5.1% |
| I | 50547 | 2.5% |
| Other values (12) | 103323 | 5.1% |
Punctuation
| Value | Count | Frequency (%) |
| — | 9646 |
seller
Text
| Distinct | 11714 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.6 MiB |
Length
| Max length | 50 |
|---|---|
| Median length | 42 |
| Mean length | 23.058749 |
| Min length | 3 |
Characters and Unicode
| Total characters | 10591967 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4076 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | KIA MOTORS AMERICA INC |
|---|---|
| 2nd row | KIA MOTORS AMERICA INC |
| 3rd row | FINANCIAL SERVICES REMARKETING (LEASE) |
| 4th row | VOLVO NA REP/WORLD OMNI |
| 5th row | FINANCIAL SERVICES REMARKETING (LEASE) |
| Value | Count | Frequency (%) |
| inc | 64357 | 4.2% |
| corporation | 42443 | 2.7% |
| credit | 41755 | 2.7% |
| services | 41236 | 2.7% |
| motor | 39756 | 2.6% |
| llc | 39159 | 2.5% |
| financial | 37352 | 2.4% |
| auto | 34572 | 2.2% |
| ford | 31378 | 2.0% |
| remarketing | 28461 | 1.8% |
| Other values (7362) | 1143076 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1099827 | 10.4% | |
| E | 947144 | 8.9% |
| A | 857349 | 8.1% |
| R | 804906 | 7.6% |
| N | 783153 | 7.4% |
| I | 758418 | 7.2% |
| O | 719484 | 6.8% |
| T | 662700 | 6.3% |
| C | 608673 | 5.7% |
| S | 546262 | 5.2% |
| Other values (37) | 2804051 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 9321535 | |
| Space Separator | 1099827 | 10.4% |
| Other Punctuation | 124312 | 1.2% |
| Dash Punctuation | 28295 | 0.3% |
| Decimal Number | 7351 | 0.1% |
| Close Punctuation | 5319 | 0.1% |
| Open Punctuation | 5319 | 0.1% |
| Math Symbol | 9 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 947144 | |
| A | 857349 | 9.2% |
| R | 804906 | 8.6% |
| N | 783153 | 8.4% |
| I | 758418 | 8.1% |
| O | 719484 | 7.7% |
| T | 662700 | 7.1% |
| C | 608673 | 6.5% |
| S | 546262 | 5.9% |
| L | 469207 | 5.0% |
| Other values (16) | 2164239 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2326 | |
| 1 | 1566 | |
| 0 | 1033 | |
| 9 | 501 | 6.8% |
| 5 | 494 | 6.7% |
| 8 | 444 | 6.0% |
| 4 | 371 | 5.0% |
| 3 | 319 | 4.3% |
| 6 | 225 | 3.1% |
| 7 | 72 | 1.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 89404 | |
| . | 23588 | 19.0% |
| & | 7413 | 6.0% |
| ' | 2248 | 1.8% |
| # | 1658 | 1.3% |
| : | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1099827 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 28295 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5319 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5319 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 9 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9321535 | |
| Common | 1270432 | 12.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 947144 | |
| A | 857349 | 9.2% |
| R | 804906 | 8.6% |
| N | 783153 | 8.4% |
| I | 758418 | 8.1% |
| O | 719484 | 7.7% |
| T | 662700 | 7.1% |
| C | 608673 | 6.5% |
| S | 546262 | 5.9% |
| L | 469207 | 5.0% |
| Other values (16) | 2164239 |
Common
| Value | Count | Frequency (%) |
| 1099827 | ||
| / | 89404 | 7.0% |
| - | 28295 | 2.2% |
| . | 23588 | 1.9% |
| & | 7413 | 0.6% |
| ) | 5319 | 0.4% |
| ( | 5319 | 0.4% |
| 2 | 2326 | 0.2% |
| ' | 2248 | 0.2% |
| # | 1658 | 0.1% |
| Other values (11) | 5035 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10591967 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1099827 | 10.4% | |
| E | 947144 | 8.9% |
| A | 857349 | 8.1% |
| R | 804906 | 7.6% |
| N | 783153 | 7.4% |
| I | 758418 | 7.2% |
| O | 719484 | 6.8% |
| T | 662700 | 6.3% |
| C | 608673 | 5.7% |
| S | 546262 | 5.2% |
| Other values (37) | 2804051 |
mmr
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 1098 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13829.019 |
| Minimum | 25 |
|---|---|
| Maximum | 182000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.0 MiB |
Quantile statistics
| Minimum | 25 |
|---|---|
| 5-th percentile | 1950 |
| Q1 | 7425 |
| median | 12300 |
| Q3 | 18250 |
| 95-th percentile | 30500 |
| Maximum | 182000 |
| Range | 181975 |
| Interquartile range (IQR) | 10825 |
Descriptive statistics
| Standard deviation | 9541.0782 |
|---|---|
| Coefficient of variation (CV) | 0.68993168 |
| Kurtosis | 12.352817 |
| Mean | 13829.019 |
| Median Absolute Deviation (MAD) | 5400 |
| Skewness | 2.0512269 |
| Sum | 6.3523183 × 109 |
| Variance | 91032172 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11650 | 1505 | 0.3% |
| 12500 | 1487 | 0.3% |
| 11600 | 1486 | 0.3% |
| 11750 | 1478 | 0.3% |
| 11850 | 1476 | 0.3% |
| 11300 | 1473 | 0.3% |
| 12700 | 1455 | 0.3% |
| 12050 | 1453 | 0.3% |
| 11050 | 1450 | 0.3% |
| 12350 | 1448 | 0.3% |
| Other values (1088) | 444636 |
| Value | Count | Frequency (%) |
| 25 | 16 | < 0.1% |
| 50 | 38 | |
| 75 | 18 | < 0.1% |
| 100 | 25 | |
| 125 | 21 | < 0.1% |
| 150 | 32 | |
| 175 | 44 | |
| 200 | 40 | |
| 225 | 35 | |
| 250 | 61 |
| Value | Count | Frequency (%) |
| 182000 | 1 | < 0.1% |
| 178000 | 1 | < 0.1% |
| 176000 | 1 | < 0.1% |
| 170000 | 3 | |
| 166000 | 2 | |
| 164000 | 1 | < 0.1% |
| 163000 | 1 | < 0.1% |
| 162000 | 1 | < 0.1% |
| 161000 | 1 | < 0.1% |
| 160000 | 2 |
sellingprice
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 1785 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13687.786 |
| Minimum | 1 |
|---|---|
| Maximum | 230000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1600 |
| Q1 | 7200 |
| median | 12200 |
| Q3 | 18200 |
| 95-th percentile | 30500 |
| Maximum | 230000 |
| Range | 229999 |
| Interquartile range (IQR) | 11000 |
Descriptive statistics
| Standard deviation | 9620.281 |
|---|---|
| Coefficient of variation (CV) | 0.70283691 |
| Kurtosis | 12.080594 |
| Mean | 13687.786 |
| Median Absolute Deviation (MAD) | 5500 |
| Skewness | 2.0054681 |
| Sum | 6.2874433 × 109 |
| Variance | 92549807 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12000 | 3782 | 0.8% |
| 11000 | 3723 | 0.8% |
| 13000 | 3687 | 0.8% |
| 10000 | 3438 | 0.7% |
| 11500 | 3347 | 0.7% |
| 14000 | 3242 | 0.7% |
| 12500 | 3175 | 0.7% |
| 9000 | 3097 | 0.7% |
| 10500 | 2994 | 0.7% |
| 9500 | 2846 | 0.6% |
| Other values (1775) | 426016 |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 100 | 13 | < 0.1% |
| 150 | 16 | < 0.1% |
| 175 | 7 | < 0.1% |
| 200 | 130 | < 0.1% |
| 225 | 79 | < 0.1% |
| 250 | 211 | < 0.1% |
| 275 | 88 | < 0.1% |
| 300 | 927 | |
| 325 | 151 | < 0.1% |
| Value | Count | Frequency (%) |
| 230000 | 1 | |
| 183000 | 1 | |
| 173000 | 1 | |
| 171500 | 1 | |
| 169500 | 1 | |
| 169000 | 1 | |
| 167000 | 1 | |
| 165000 | 2 | |
| 163000 | 2 | |
| 161000 | 1 |
saledate
Date
| Distinct | 3585 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.0 MiB |
| Minimum | 2014-01-01 01:15:00 |
|---|---|
| Maximum | 2015-07-20 19:30:00 |
| Unnamed: 0 | body | color | condition | interior | mmr | odometer | sellingprice | state | transmission | year | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Unnamed: 0 | 1.000 | 0.019 | 0.013 | 0.027 | 0.013 | 0.036 | -0.020 | 0.042 | 0.073 | 0.012 | 0.067 |
| body | 0.019 | 1.000 | 0.071 | 0.062 | 0.073 | 0.149 | 0.066 | 0.129 | 0.049 | 0.234 | 0.087 |
| color | 0.013 | 0.071 | 1.000 | 0.057 | 0.093 | 0.055 | 0.065 | 0.048 | 0.067 | 0.081 | 0.092 |
| condition | 0.027 | 0.062 | 0.057 | 1.000 | 0.053 | 0.424 | -0.407 | 0.478 | 0.086 | 0.033 | 0.388 |
| interior | 0.013 | 0.073 | 0.093 | 0.053 | 1.000 | 0.061 | 0.085 | 0.061 | 0.059 | 0.077 | 0.104 |
| mmr | 0.036 | 0.149 | 0.055 | 0.424 | 0.061 | 1.000 | -0.713 | 0.980 | 0.061 | 0.029 | 0.686 |
| odometer | -0.020 | 0.066 | 0.065 | -0.407 | 0.085 | -0.713 | 1.000 | -0.701 | 0.088 | 0.032 | -0.812 |
| sellingprice | 0.042 | 0.129 | 0.048 | 0.478 | 0.061 | 0.980 | -0.701 | 1.000 | 0.050 | 0.015 | 0.669 |
| state | 0.073 | 0.049 | 0.067 | 0.086 | 0.059 | 0.061 | 0.088 | 0.050 | 1.000 | 0.078 | 0.096 |
| transmission | 0.012 | 0.234 | 0.081 | 0.033 | 0.077 | 0.029 | 0.032 | 0.015 | 0.078 | 1.000 | 0.092 |
| year | 0.067 | 0.087 | 0.092 | 0.388 | 0.104 | 0.686 | -0.812 | 0.669 | 0.096 | 0.092 | 1.000 |
| Unnamed: 0 | year | make | model | trim | body | transmission | vin | state | condition | odometer | color | interior | seller | mmr | sellingprice | saledate | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 2015 | KIA | SORENTO | LX | SUV | AUTOMATIC | 5xyktca69fg566472 | CA | 5.0 | 16639.0 | WHITE | BLACK | KIA MOTORS AMERICA INC | 20500.0 | 21500.0 | 2014-12-16 04:30:00 |
| 1 | 1 | 2015 | KIA | SORENTO | LX | SUV | AUTOMATIC | 5xyktca69fg561319 | CA | 5.0 | 9393.0 | WHITE | BEIGE | KIA MOTORS AMERICA INC | 20800.0 | 21500.0 | 2014-12-16 04:30:00 |
| 2 | 2 | 2014 | BMW | 3 SERIES | 328I SULEV | SEDAN | AUTOMATIC | wba3c1c51ek116351 | CA | 45.0 | 1331.0 | GRAY | BLACK | FINANCIAL SERVICES REMARKETING (LEASE) | 31900.0 | 30000.0 | 2015-01-14 20:30:00 |
| 3 | 3 | 2015 | VOLVO | S60 | T5 | SEDAN | AUTOMATIC | yv1612tb4f1310987 | CA | 41.0 | 14282.0 | WHITE | BLACK | VOLVO NA REP/WORLD OMNI | 27500.0 | 27750.0 | 2015-01-28 20:30:00 |
| 4 | 4 | 2014 | BMW | 6 SERIES GRAN COUPE | 650I | SEDAN | AUTOMATIC | wba6b2c57ed129731 | CA | 43.0 | 2641.0 | GRAY | BLACK | FINANCIAL SERVICES REMARKETING (LEASE) | 66000.0 | 67000.0 | 2014-12-18 04:30:00 |
| 5 | 5 | 2015 | NISSAN | ALTIMA | 2.5 S | SEDAN | AUTOMATIC | 1n4al3ap1fn326013 | CA | 1.0 | 5554.0 | GRAY | BLACK | ENTERPRISE VEHICLE EXCHANGE / TRA / RENTAL / TULSA | 15350.0 | 10900.0 | 2014-12-30 04:00:00 |
| 6 | 6 | 2014 | BMW | M5 | BASE | SEDAN | AUTOMATIC | wbsfv9c51ed593089 | CA | 34.0 | 14943.0 | BLACK | BLACK | THE HERTZ CORPORATION | 69000.0 | 65000.0 | 2014-12-17 04:30:00 |
| 7 | 7 | 2014 | CHEVROLET | CRUZE | 1LT | SEDAN | AUTOMATIC | 1g1pc5sb2e7128460 | CA | 2.0 | 28617.0 | BLACK | BLACK | ENTERPRISE VEHICLE EXCHANGE / TRA / RENTAL / TULSA | 11900.0 | 9800.0 | 2014-12-16 05:00:00 |
| 8 | 8 | 2014 | AUDI | A4 | 2.0T PREMIUM PLUS QUATTRO | SEDAN | AUTOMATIC | wauffafl3en030343 | CA | 42.0 | 9557.0 | WHITE | BLACK | AUDI MISSION VIEJO | 32100.0 | 32250.0 | 2014-12-18 04:00:00 |
| 9 | 9 | 2014 | CHEVROLET | CAMARO | LT | CONVERTIBLE | AUTOMATIC | 2g1fb3d37e9218789 | CA | 3.0 | 4809.0 | RED | BLACK | D/M AUTO SALES INC | 26300.0 | 17500.0 | 2015-01-19 20:00:00 |
| Unnamed: 0 | year | make | model | trim | body | transmission | vin | state | condition | odometer | color | interior | seller | mmr | sellingprice | saledate | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 558824 | 558824 | 2013 | AUDI | S5 | PREMIUM PLUS QUATTRO | CONVERTIBLE | AUTOMATIC | waucgafh6dn005382 | FL | 5.0 | 20158.0 | SILVER | BLACK | PRESTIGE AUDI | 43900.0 | 42000.0 | 2015-07-08 23:00:00 |
| 558825 | 558825 | 2011 | SUBARU | FORESTER | 2.5X | SUV | MANUAL | jf2shbac9bg741815 | CA | 41.0 | 71693.0 | SILVER | BLACK | REMARKETING BY GE/BILLION DODGE | 12300.0 | 11750.0 | 2015-07-08 02:30:00 |
| 558826 | 558826 | 2014 | JEEP | GRAND CHEROKEE | LIMITED | SUV | AUTOMATIC | 1c4rjebg4ec573100 | CA | 44.0 | 9024.0 | GRAY | BLACK | ENTERPRISE VEHICLE EXCHANGE / TRA / RENTAL / TULSA | 29800.0 | 17300.0 | 2015-07-09 02:00:00 |
| 558827 | 558827 | 2014 | JEEP | GRAND CHEROKEE | LAREDO | SUV | AUTOMATIC | 1c4rjfag0ec466276 | PA | 42.0 | 25180.0 | GRAY | BLACK | HERTZ CORPORATION/GDP | 26000.0 | 24500.0 | 2015-07-06 23:30:00 |
| 558828 | 558828 | 2012 | DODGE | GRAND CARAVAN | AMERICAN VALUE PACKAGE | MINIVAN | AUTOMATIC | 2c4rdgbg1cr349287 | MA | 37.0 | 97036.0 | SILVER | GRAY | GE FLEET SERVICES FOR ITSELF/SERVICER | 8300.0 | 7800.0 | 2015-07-06 23:30:00 |
| 558831 | 558831 | 2011 | BMW | 5 SERIES | 528I | SEDAN | AUTOMATIC | wbafr1c53bc744672 | FL | 39.0 | 66403.0 | WHITE | BROWN | LAUDERDALE IMPORTS LTD BMW PEMBROK PINES | 20300.0 | 22800.0 | 2015-07-06 23:15:00 |
| 558833 | 558833 | 2012 | RAM | 2500 | POWER WAGON | CREW CAB | AUTOMATIC | 3c6td5et6cg112407 | WA | 5.0 | 54393.0 | WHITE | BLACK | I -5 UHLMANN RV | 30200.0 | 30800.0 | 2015-07-08 02:30:00 |
| 558834 | 558834 | 2012 | BMW | X5 | XDRIVE35D | SUV | AUTOMATIC | 5uxzw0c58cl668465 | CA | 48.0 | 50561.0 | BLACK | BLACK | FINANCIAL SERVICES REMARKETING (LEASE) | 29800.0 | 34000.0 | 2015-07-08 02:30:00 |
| 558835 | 558835 | 2015 | NISSAN | ALTIMA | 2.5 S | SEDAN | AUTOMATIC | 1n4al3ap0fc216050 | GA | 38.0 | 16658.0 | WHITE | BLACK | ENTERPRISE VEHICLE EXCHANGE / TRA / RENTAL / TULSA | 15100.0 | 11100.0 | 2015-07-08 23:45:00 |
| 558836 | 558836 | 2014 | FORD | F-150 | XLT | SUPERCREW | AUTOMATIC | 1ftfw1et2eke87277 | CA | 34.0 | 15008.0 | GRAY | GRAY | FORD MOTOR CREDIT COMPANY LLC PD | 29600.0 | 26700.0 | 2015-05-27 22:30:00 |